feat: add schema isolation infrastructure for parallel Jest workers #3425

idoshamun · 2026-01-09T21:02:32Z

Summary

Add optional schema-per-worker isolation infrastructure for test parallelization (ENG-283)
Add TYPEORM_SCHEMA and ENABLE_SCHEMA_ISOLATION environment variable support in data-source.ts
Add createWorkerSchema() to create isolated schemas by copying table structures and views
Update cleanDatabase() to use schema-aware queries

How it works

When ENABLE_SCHEMA_ISOLATION=true is set along with JEST_WORKER_ID, each Jest worker gets its own PostgreSQL schema (test_worker_1, test_worker_2, etc.). The setup:

Creates the worker schema
Copies all table structures from public schema using LIKE ... INCLUDING ALL
Copies views with schema references updated
Uses schema-prefixed queries for database cleanup

Current state

The infrastructure is in place but not enabled in CI. Enabling it requires setting ENABLE_SCHEMA_ISOLATION=true and changing --runInBand to --maxWorkers=4.

Known limitation

68/73 boot tests pass with schema isolation, but 5 tests fail due to raw SQL queries in auth/boot code that don't use the schema prefix. Full enablement (ENG-284) is blocked until these are addressed.

Test plan

All tests pass without schema isolation (NODE_ENV=test npx jest --runInBand)
Lint passes
93% of boot tests pass with schema isolation (68/73)
Full test suite in CI

ENG-283

pulumi · 2026-01-09T21:04:56Z

🍹 The Update (preview) for dailydotdev/api/prod (at 20dd9b8) was successful.

Resource Changes

    Name                                                   Type                                  Operation
+-  vpc-native-k8s-secret                                  kubernetes:core/v1:Secret             create-replacement
~   vpc-native-generate-search-invites-cron                kubernetes:batch/v1:CronJob           update
~   vpc-native-user-profile-updated-sync-cron              kubernetes:batch/v1:CronJob           update
~   vpc-native-update-tag-recommendations-cron             kubernetes:batch/v1:CronJob           update
~   vpc-native-validate-active-users-cron                  kubernetes:batch/v1:CronJob           update
~   vpc-native-personalized-digest-deployment              kubernetes:apps/v1:Deployment         update
~   vpc-native-hourly-notification-cron                    kubernetes:batch/v1:CronJob           update
~   vpc-native-clean-zombie-opportunities-cron             kubernetes:batch/v1:CronJob           update
-   vpc-native-api-clickhouse-migration-ffae6b22           kubernetes:batch/v1:Job               delete
~   vpc-native-clean-zombie-users-cron                     kubernetes:batch/v1:CronJob           update
~   vpc-native-update-source-public-threshold-cron         kubernetes:batch/v1:CronJob           update
~   vpc-native-clean-zombie-images-cron                    kubernetes:batch/v1:CronJob           update
~   vpc-native-ws-deployment                               kubernetes:apps/v1:Deployment         update
~   vpc-native-update-current-streak-cron                  kubernetes:batch/v1:CronJob           update
~   vpc-native-sync-subscription-with-cio-cron             kubernetes:batch/v1:CronJob           update
~   vpc-native-private-deployment                          kubernetes:apps/v1:Deployment         update
~   vpc-native-clean-gifted-plus-cron                      kubernetes:batch/v1:CronJob           update
~   vpc-native-clean-stale-user-transactions-cron          kubernetes:batch/v1:CronJob           update
~   vpc-native-deployment                                  kubernetes:apps/v1:Deployment         update
~   vpc-native-post-analytics-history-day-clickhouse-cron  kubernetes:batch/v1:CronJob           update
~   vpc-native-generic-referral-reminder-cron              kubernetes:batch/v1:CronJob           update
~   vpc-native-check-analytics-report-cron                 kubernetes:batch/v1:CronJob           update
~   vpc-native-calculate-top-readers-cron                  kubernetes:batch/v1:CronJob           update
-   vpc-native-api-db-migration-ffae6b22                   kubernetes:batch/v1:Job               delete
~   vpc-native-update-source-tag-view-cron                 kubernetes:batch/v1:CronJob           update
+   vpc-native-api-clickhouse-migration-069c4260           kubernetes:batch/v1:Job               create
~   vpc-native-update-tags-str-cron                        kubernetes:batch/v1:CronJob           update
-   api-sub-api.parse-opportunity-feedback                 gcp:pubsub/subscription:Subscription  delete
~   vpc-native-update-highlighted-views-cron               kubernetes:batch/v1:CronJob           update
~   vpc-native-daily-digest-cron                           kubernetes:batch/v1:CronJob           update
~   vpc-native-temporal-deployment                         kubernetes:apps/v1:Deployment         update
~   vpc-native-post-analytics-clickhouse-cron              kubernetes:batch/v1:CronJob           update
+   vpc-native-api-db-migration-069c4260                   kubernetes:batch/v1:Job               create
~   vpc-native-update-views-cron                           kubernetes:batch/v1:CronJob           update
~   vpc-native-update-trending-cron                        kubernetes:batch/v1:CronJob           update
~   vpc-native-clean-zombie-user-companies-cron            kubernetes:batch/v1:CronJob           update
~   vpc-native-personalized-digest-cron                    kubernetes:batch/v1:CronJob           update
~   vpc-native-bg-deployment                               kubernetes:apps/v1:Deployment         update

Add infrastructure for PostgreSQL schema isolation to enable parallel Jest workers within CI jobs. Each worker gets its own schema to prevent data conflicts between tests. Changes: - Add TYPEORM_SCHEMA env var support and auto-schema selection based on JEST_WORKER_ID when ENABLE_SCHEMA_ISOLATION=true - Set PostgreSQL search_path at connection level for raw SQL queries - Add createWorkerSchema() to copy table structures, views, and migrations data from public schema to worker schemas - Use pg_get_serial_sequence() for sequence resets to handle different sequence naming conventions Known limitation: Database triggers are not copied as they reference functions in the public schema. Schema isolation is opt-in via ENABLE_SCHEMA_ISOLATION=true environment variable. Addresses ENG-283

Enable parallel test execution within CI jobs by giving each Jest worker its own PostgreSQL schema. This significantly improves test throughput. Changes: - Update CircleCI to use --maxWorkers=4 with ENABLE_SCHEMA_ISOLATION=true - Add test:parallel npm script for local parallel test execution - Enhance createWorkerSchema() to copy: - Table structures (LIKE ... INCLUDING ALL) - Views with schema references updated - Materialized views with schema references updated - All user-defined functions with schema references updated - Triggers with schema and function references updated The schema isolation copies all database objects from public schema to worker-specific schemas (test_worker_1, test_worker_2, etc.), allowing tests to run in parallel without data conflicts. Addresses ENG-284

Fixes several issues with PostgreSQL schema isolation for parallel Jest workers: 1. FK constraint copying: Tables copied with INCLUDING ALL don't include FK constraints. Now explicitly copy FK constraints with correct schema references so CASCADE and SET NULL actions work properly. 2. Seed data copying: Copy critical seed data (ghost user '404', system user, system sources, etc.) to worker schemas so tests don't fail when expecting these records. 3. Trigger function search_path: Add SET search_path clause to plpgsql functions so unqualified table names in trigger bodies resolve to the correct worker schema instead of defaulting to public. 4. Hardcoded schema references: Remove explicit 'public.' references from cron jobs (updateViews, updateDiscussionScore, checkReferralReminder) so they work with schema isolation. 5. Increased beforeAll timeout to 60s to accommodate FK constraint copying. Test results with schema isolation: 180/198 test suites pass (3785/3916 tests).

Prevent deletion of predefined seed/reference data tables during test cleanup to maintain test stability and ensure critical data remains intact.

When CREATE TABLE ... LIKE ... INCLUDING ALL copies tables, column defaults still reference the original public schema sequences. This caused FK constraint violations when tests used TypeORM's save() with @PrimaryGeneratedColumn('increment') - the database used the wrong sequence position instead of starting at 1. Changes: - Create new sequences in worker schemas and update column defaults - Remove seed data copying for tables where tests create own fixtures (advanced_settings, source_category, prompt) - Use schema-qualified table names in sequence reset logic

…ions PostgreSQL's pg_matviews.definition returns normalized SQL where table names appear unqualified, but internally retains OID references to the original tables. Simply setting search_path before CREATE VIEW didn't work - views still bound to public schema tables. Solution: Explicitly replace all FROM/JOIN table references with schema-qualified versions using regex patterns. This handles: - FROM tablename - JOIN tablename - FROM (tablename alias - PostgreSQL's parenthesized JOIN format This fixes materialized views like trending_post, trending_tag, and tag_recommendation to correctly query worker schema tables instead of public schema tables. Test results: tags.ts now passes 15/15 (was 9/15 before fix)

Replace the approach of copying schema structure with running actual migrations for worker schemas. This ensures exact parity with how the schema was built. Key changes: - Add replaceSchemaReferences() to transform public schema refs in migrations - Add wrapQueryRunner() to intercept SQL queries during migration execution - Fix migration ordering to use 13-digit timestamp extraction - Reduce pool size to 10 for tests to avoid connection exhaustion - Replace flushall() with targeted deleteKeysByPattern() in boot.ts - Skip pub/sub test in parallel mode (channels can't be worker-isolated) Results: 197/198 test suites pass consistently with 2 parallel workers

…iency - Create __tests__/globalSetup.ts to run migrations once before all workers - Remove dead createWorkerSchema code from setup.ts (now in globalSetup) - Add globalSetup to jest.config.js This prevents each Jest worker from running migrations independently, reducing memory usage and avoiding SIGKILL/OOM issues in CI.

- Replace iterative DELETE + sequence reset with single TRUNCATE CASCADE - Increase timeout for 500-iteration rate limit test to 60s TRUNCATE is faster and RESTART IDENTITY handles sequence reset automatically.

- Change resource_class from large (8GB) to xlarge (16GB) - Remove CircleCI parallelism (1 container instead of 6) - Keep Jest maxWorkers=4 for parallel test execution - Increase NODE_OPTIONS memory to 12GB (75% of 16GB)

idoshamun force-pushed the eng-283-schema-isolation-infrastructure branch from 82b36ef to fe26d6a Compare January 9, 2026 21:20

idoshamun added 18 commits January 9, 2026 23:39

feat: exclude seed data tables from deletion in tests

c5d1b0c

Prevent deletion of predefined seed/reference data tables during test cleanup to maintain test stability and ensure critical data remains intact.

fix: lint issue

0ab7416

fix: lint formatting

0ced0b3

fix: add 30s timeout to beforeEach hook for CI

1eb78e5

fix: reduce connection pool sizes to prevent OOM on CI

c2eb2a4

fix: add global testTimeout of 30s for CI stability

f237ec0

perf: optimize cleanDatabase with single TRUNCATE statement

8ef7bc8

- Replace iterative DELETE + sequence reset with single TRUNCATE CASCADE - Increase timeout for 500-iteration rate limit test to 60s TRUNCATE is faster and RESTART IDENTITY handles sequence reset automatically.

refactor: revert clean database logic

911bc5d

fix: add jest.setTimeout(30000) for reliable global timeout

80cd735

ci: use xlarge resource and remove parallelism to fix OOM

4f88f04

- Change resource_class from large (8GB) to xlarge (16GB) - Remove CircleCI parallelism (1 container instead of 6) - Keep Jest maxWorkers=4 for parallel test execution - Increase NODE_OPTIONS memory to 12GB (75% of 16GB)

fix: clear rate limit key before each rate limiting test

23fe501

ci: add sequential test step for Redis pub/sub tests

20dd9b8

idoshamun force-pushed the eng-283-schema-isolation-infrastructure branch from 011de8f to 20dd9b8 Compare January 14, 2026 11:52

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: add schema isolation infrastructure for parallel Jest workers #3425

feat: add schema isolation infrastructure for parallel Jest workers #3425

Uh oh!

idoshamun commented Jan 9, 2026

Uh oh!

pulumi bot commented Jan 9, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

feat: add schema isolation infrastructure for parallel Jest workers #3425

Are you sure you want to change the base?

feat: add schema isolation infrastructure for parallel Jest workers #3425

Uh oh!

Conversation

idoshamun commented Jan 9, 2026

Summary

How it works

Current state

Known limitation

Test plan

Uh oh!

pulumi bot commented Jan 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Resource Changes

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

pulumi bot commented Jan 9, 2026 •

edited

Loading